MAJOR ARTICLE 


The Large 386-nt Deletion in SARS-Assodated 
Coronavirus: Evidence for Quasispecies? 


Julian W. Tang , 1 Jo L. K. Cheung , 1 Ida M. T. Chu , 1 Joseph J. Y. Sung , 23 Malik Peiris , 4 and Paul K. S. Chan 13 

Departments of ’Microbiology and z Medicine and Therapeutics, Centre for Emerging Infectious Diseases, and 3 School of Public Health, 
The Chinese University of Hong Kong, Prince of Wales Hospital, Shatin, and ’Department of Microbiology, The University of Hong Kong, 
Queen Mary Hospital, Pokfulam, Hong Kong SAR, China 


The severe acute respiratory syndrome-associated coronavirus (SARS-CoV) is reported to have deletions of 
various sizes. Recently, the large 386-nucleotide deletion (L386del) comprising nucleotide positions 27719- 
28104 and spanning open reading frames 9-11 has been reported in the genomes of some human isolates 
from Hong Kong. In this study, archived specimens from 71 patients with SARS who were admitted to the 
New Territory East Cluster Hospitals in Hong Kong were analyzed to determine whether the L386del variant 
of SARS-CoV was present. There was no clear relationship between the presence of the L386del variant and 
SARS clinical severity as defined either by the need for intensive-care therapy and/or ventilation or by death. 
One patient had evidence of both the L386del variant and the wild-type variant in the same clinical specimen, 
supporting the idea that SARS-CoV exists as a quasispecies in some patients, although the clinical significance 
of these quasispecies remains unclear. 


Since the worldwide SARS epidemic in 2003, many 
studies have focused on the genomics of the SARS- 
associated coronavirus (SARS-CoV), to understand more 
about the pathogenic potential of this new pathogen. 
Several significant differences between the various SARS- 
CoV isolates from different patients and animal hosts 
have now been identified. Guan et al. [ 1 ] found that the 
majority of human SARS-CoV associated with the epi¬ 
demic had a 29-nt deletion spanning open reading frames 
(ORFs) 10 and 11 (in Guan et al.’s original nomencla¬ 
ture), compared with the SARS-like coronavirus (SL- 
CoV) identified in Himalayan palm civet cats (Paguma 
larvata) from a Guangdong live-animal market; this 29- 
nt deletion in the human SARS-CoV genome comprised 
nucleotide positions (np) 27869-27897 when that ge- 
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nome was aligned with the TOR-2 SARS-CoV reference 
sequence (GenBank accession no. NC004718). The func¬ 
tional significance of this deletion is uncertain, although 
it may represent an adaptive mutation to a human host 
by SARS-CoV from a potential civet-cat animal res¬ 
ervoir [1, 2]. However, it has been reported that the 
human-adapted SARS-CoV with the 29-nt deletion re¬ 
mains just as able to infect the civet cat [3]. Other 
studies have suggested that bats may be an alternative 
animal reservoir for SARS-CoV [4, 5], which suggests 
that the civet cat is a secondary, amplifying host for 
SARS-CoV rather than the primary reservoir. The bat 
SL-CoV also has the additional 29 nt in ORFs 10 and 
11 and, in this respect, is similar to the civet-cat SL- 
CoV [4, 5]. Most recently, experiments have shown that 
replacement of this 29-nt deletion does not alter the in 
vitro or in vivo replication, persistence, or duration of 
the viral load in the murine model, although this find¬ 
ing does not exclude the possibility that there could 
be more-dramatic effects if a civet-cat or raccoon-dog 
model were used [6]. 

Other deletions have been noted in other areas of 
the SARS-CoV genome. An 82-nt deletion in ORF 8 
has been reported in a study [7] whose authors also 
suggested that the earliest phase of the SARS epidemic 
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Figure 1 . Location, in the SARS-associated coronavirus genome, of the large 386-nt deletion (L386del), in relation to open reading frames (ORFs) 
9-11. PCR, polymerase chain reaction. (N.B.: the sizes of the bars are for illustrative purposes only and are not to scale.) 


(i.e., when SARS was confined to China) were characterized by 
2 major SARS-CoV genotypes, one of them characterized by 
the 29-nt deletion and the other harboring the 82-nt deletion. 
These 2 genotypes had also been isolated from animals in 
Guangdong and other provinces (e.g., Hubei) in China. The 
authors of that study also noted that these 2 genotypes were 
not restricted to the early part of the epidemic but were isolated 
during the middle phase as well; in addition, they reported the 
presence, during the late phase of the epidemic, of a third 
genotype, characterized by a 415-nt deletion resulting in the 
loss of ORF 8. They speculated that ORF 8 may have either 
no or an unimportant coding function, because the virus’s life- 
cycle can be completed even if ORF 8 is absent [7]. 

Even within the human population, it seems that SARS-CoV 
has continued to adapt during the course of the epidemic. A 
report from Hong Kong has documented the presence, in some 
human SARS-CoV genomes, of a large 386-nt deletion (L386del) 
comprising np 27719-28104 [8]. This larger deletion flanks the 
29-nt deletion segment described above, disrupting ORF 9— 
and, simultaneously, eliminating ORFs 10 and 11—of the SARS- 
CoV genome. So far, there have been no other reports of this 
large genomic deletion. 

The present study reports additional data on the presence 
of the L386del variant of SARS-CoV in other Hong Kong Chi¬ 
nese patients during the SARS epidemic in 2003. In addition, 
it reports 1 patient from whom multiple clinical specimens were 
taken that had both the L386del variant and the wild-type 
variant, indicating that the 2 variants can coexist not only 
within a single host but also within the same clinical specimen. 


The present study also assesses whether this deletion is asso¬ 
ciated with more-severe SARS, which is clinically defined either 
as requiring admission to intensive care (IC) and/or ventilation 
or as resulting in death. 

PATIENTS, MATERIALS, AND METHODS 

The clinical specimens used in the present study were obtained 
from patients with laboratory-confirmed SARS-CoV infection 
[9] who had been admitted to the New Territory East Cluster 
Hospitals during the Hong Kong SARS epidemic, which oc¬ 
curred in March-June 2003. All clinical specimens that had 
tested positive for SARS-CoV RNA during the routine inves¬ 
tigation were retrieved for the present study, provided that 
sufficient remaining volumes were available; if the original clin¬ 
ical specimens were unavailable, their primary isolates, obtained 
from Vero cell culture, were used. A pair of specific primers, 
spanning np 26980-28218 (1239 bp), were designed for poly¬ 
merase chain reaction (PCR) used to amplify the L386del region 
(figure 1); this region spans the entire length of ORFs 9-11, 
which are known to have—and to be disrupted by—this de¬ 
letion [8]. All PCR products that gel electrophoresis revealed 
as having a size smaller than the 1239 bp expected for the wild 
type were then sequenced, to map the deletion site. 

RESULTS 

The present study included 71 patients (age range, 15-98 years; 
mean ± SD, 54.6 ± 21.2 years), 25 of whom were male. A 
total of 97 original specimens/primary isolates were available 
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Table 1. Proportions of specimens and patients with the 
large 386-nt deletion (L386del) variant of SARS-associated 
coronavirus. 


Source 

Specimens 
with L386del 

Patients with 
L386del in any 
specimen 

Respiratory tract 0 

16/64 (25.0) 

13/59 (22.0) 

Gastroenterological tract b 

16/24 (66.7) 

9/13 (69.2) 

Urine 

2/4 (50.0) 

2/4 (50.0) 

Tissue 0 

2/5 (40.0) 

2/5 (40.0) 

Total 

36/97 (37.1) 

22 d /71 e (31.0) 


NOTE. Data are proportion (%). 

a Includes nasopharyngeal aspirate, throat and nasal swabs, and 
tracheal aspirate. 

b Includes stool and rectal swabs. 

c Comprises 4 biopsy specimens from lung and 1 biopsy specimen 
from terminal ileum. 

d Of these 22 patients, 4 had >1 type of specimen with the L386del 
variant. 

e Of these 71 patients, 10 had >1 specimen tested. 

from these 71 patients: only 1 specimen was available from 
each of 61 (85.9%) of these patients, 2 specimens were available 
from each of 4 (5.6%) of these patients, and ^3 specimens 
were available from each of 6 (8.5%) of these patients. The 
distribution of specimen types is shown in table 1; the majori¬ 


ty of the specimens studied were from the respiratory tract 
(66.0%) or stool/rectal swabs (24.7%). 

A total of 36 (37.1%) of the 97 specimens had a PCR product 
with a size less than that expected for the wild-type variant. 
The sequencing results for these PCR products showed that they 
all had a 386-bp deletion corresponding to np 27719-28104 of 
the TOR-2 SARS-CoV reference sequence. All these sequences 
were submitted to GenBank (accession nos. DQ523267- 
DQ523302). The proportion of stool/rectal swab specimens 
with the L386del variant was significantly higher than that of 
respiratory specimens (66.7% vs. 25.0%, respectively [P< 
.001, by x 2 test]); the number of urine specimens was too few 
to allow for similar statistical analysis. In the patients, the dis¬ 
tribution of variants was as follows: 49 (69.0%) of the 71 pa¬ 
tients had only the wild-type variant, 20 (28.2%) of them had 
only the L386del variant, and 2 (2.8%) of them had both 
variants. 

Of the 71 patients studied, 36 (50.7%) had “mild” SARS 
illness (clinically defined either as not requiring admission to 
IC and/or ventilation or as recovered), and 8 (22.2%) of these 
36 patients had the L386del variant; the remaining 35 (49.3%) 
of these 71 patients had “severe” SARS illness (clinically defined 
either as requiring admission to IC and/or ventilation or as 
resulting in death), and in 14 (40.0%) of these 35 patients >1 
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Figure 2. Presence of the large 386-nt deletion (L386del) variant of SARS-associated coronavirus, in relation to date at onset of fever. Each diamond 
represents 1 patient examined in the present study; the diamonds are positioned according to the date at onset of fever, as indicated on the x-axis. 
Patients with the L386del variant (black diamonds) are clustered toward the upper part of the figure; patients without it (white diamonds) are clustered 
toward the lower portion of the figure. 
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Table 2. Distribution of the large 386-nt deletion (L386del) variant of SARS-associated coronavirus (SARS-CoV), in patients from 
whom >3 specimens were available for study. 


Patient 

Sex 

(age, years) 

Date at 
onset of 
fever, 2003 

Clinical outcome 

Specimens positive for SARS-CoV a 
(specimens with L386del [collection dates, 20031) 


Intensive care/ 
ventilation 6 

Death 

Respiratory tract 

Gastrointestinal tract 

Urine 

A 

F (15) 

22 April 

No 

No 

1 (1 [26 April 031) 

2 (2 [29 April and 2 May]) 


B 

F (84) 

28 April 

No 

No 


2 (0) 

1 (0) 

C 

M (68) 

7 May 

No 

No 


4 (4 [19, 22, 25, and 27 May]) 


D 

F (43) 

12 May 

Yes 

No 

5 (0) 

1 (0) 


E 

F (59) 

15 May 

Yes 

Yes 

1 (0) 

3 (1 [11 June]) 


F 

F (98) 

31 May 

No 

Yes 

5 (4 [1, 5, 6, and 9 Jun]) 

3 (2 [6 and 9 June]) 



a Either the L386del variant or the wild-type variant. 
b Patient required admission to intensive care and/or ventilation. 


specimen contained the L386del variant. Although a higher 
proportion of patients with severe SARS illness were found to 
have the L386del variant, the difference was not statistically 
significant (P = .1, by x 2 test). 

The correlation between the L386del variant and the time at 
onset of SARS is shown in figure 2. The L386del variant ap¬ 
peared at the end of March 2003, immediately after the major 
community (i.e., Amoy Gardens) outbreak, and then cocir¬ 
culated with the wild-type variant and became predominant 
toward the end of the outbreak in Hong Kong. 

All 4 (100%) of the patients from whom 2 specimens each 
were available for study had the L386del variant, and it was 
present in all 8 specimens. The characteristics, specimen dis¬ 
tribution, and results of the investigation of the 6 patients (pa¬ 
tients A-F) from whom &3 specimens were available for study 
are shown in table 2. In patient F, both the L386del variant 
and the wild-type variant were detected in the same specimen 
and also in specimens taken from different sites during the 
same day (table 3 and figure 3). 

DISCUSSION 

The present study used original clinical specimens or, if the 
latter were unavailable, primary isolates of SARS-CoV. Because 
Poon et al. [10] had documented some single-nucleotide poly¬ 
morphisms in the SARS-CoV genome of cultured specimens, 
we also tested the effect that in vitro passage in Vero cells (which 
had been used in the culturing of the original clinical specimen) 
might have on the emergence of the L386del variant. No emer¬ 
gence of the L386del variant was observed; therefore, the pres¬ 
ent study’s results regarding these primary SARS-CoV iso¬ 
lates are likely to reflect the situation in the original clinical 
specimens. 

The results of the present study show that the L386del variant 
is more likely to be found in specimens from the gastrointestinal 
tract than in those from the respiratory tract. It has been shown 
that both the lungs and gastrointestinal tract are the predom¬ 


inant sites for SARS-CoV tropism [11], and high viral loads 
have been documented in both tissue types [12]. However, the 
duration of shedding from the 2 sites seems to be different: 
analysis based on the diagnostic yield of specimens taken at 
different times [9] has shown that virus shedding from the 
gastrointestinal tract persists longer than does that from the 
respiratory tract. This longer duration of infection might favor 
the generation of mutants, including the L386del variant in¬ 
vestigated in the present study. 

At present, the clinical implications of the L386del variant 
remain uncertain. Although the present study has shown that 
a higher proportion (49.3% vs. 22.2%) of patients with severe 
SARS infection have the L386del variant, the difference is not 
statistically significant (P = .1, by x 2 test). Furthermore, be¬ 
cause of the retrospective nature of the present study, it was 
not able to control for all possible confounding factors, such 
as variation in the number and nature of examined specimens 
from the patients, the timing of the collection of the specimens, 
and the treatment regimen used. 


Table 3. Distribution of the large 386-nt deletion (L386del) var¬ 
iant and the wild-type variant of SARS-associated coronavirus 
(SARS-CoV), in serial specimens collected from patient F. 


Specimen- 
collection 
date in 2003 a 

Source 

Days after 
onset of fever 
(31 May) 

SARS-CoV variant 

Wild type 

L386del 

31 May (A) 

Nasal swab 

0 

Present 

Absent 

1 June (B) 

Nasal swab 

1 

Present 

Present 

5 June (C) 

Nasal swab 

5 

Absent 

Present 

6 June (D) 

Stool 

6 

Present 

Present 

6 June (E) 

Nasal swab 

6 

Absent 

Present 

6 June (F) 

Stool 

6 

Present 

Absent 

9 June (G) 

Nasal swab 

9 

Absent 

Present 

9 June (H) 

Stool 

9 

Absent 

Present 


NOTE. All results are from original specimens. 

a Letters in parentheses are those used to label the lanes in the gel-elec- 
trophoresis image shown in figure 3. 
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Figure 3. Gel-electrophoresis imaging of products of polymerase chain reaction (PCR) using primers specific for the large 386-nt deletion (L386del) 
in SARS-associated coronavirus. Lanes are labeled with specimen type and collection date (nos. in parentheses are no. of days after onset of fever). 
Lane M, marker; lanes A and F, wild-type variant only; lanes C, E, G, and H, L386del variant only; lanes B and D, mixture of the L386del variant and 
the wild-type variant. PCR-product sizes for the L386del variant and the wild-type variant are 853 bp and 1239 bp, respectively. 


At a population level, the L386del variant appeared soon 
after the SARS outbreak in Hong Kong and became dominant 
toward the end of this outbreak. This observation is in keeping 
with the hypothesis that the L386del variant has adapted to 
human infection and is being transmitted as efficiently as—or 
even more efficiently than—the wild-type variant. 

The present study has shown that, at any given time, both 
the L386del variant and the wild-type variant can be present 
within the same clinical specimen of a patient, an example being 
patient F in the present study; and this finding provides evi¬ 
dence that SARS-CoV evolves rapidly within the same host 
and, therefore, may exist as a quasispecies. The results observed 
in patient F are also consistent with what has been observed 
at the population level, where the L386del variant was found 
to have appeared during the course of the SARS outbreak and 
to have become dominant toward the end of this outbreak. 

The L386del variant disrupts or completely eliminates ORFs 
9 (np 27638-27772), 10 (np 27779-27898), and 11 (np 27864- 
28118). The functions of these ORFs remain unknown. An 
earlier report on the SARS-CoV genome [13] predicted that 
ORFs 9, 10, and 11 would code for proteins comprising 44, 
39, and 84 aa, respectively. That report also noted that ORF 9 
was comparable to a putative sterol-C5 desaturase and a Clos¬ 
tridium perfringens protein but that ORF 10 was not comparable 
to any known proteins listed in public databases. The structure 


predicted for ORFs 9 and 10 was a single transmembrane helix, 
and that for ORF 11 was a soluble protein; the latter showed 
a slight match to the human coronavirus S glycoprotein pre¬ 
cursor [13]. More recently, Li et al. [14] have suggested that 
ORF 10 may play a role by impairing the oxidoreductase system 
in the mitochondria of SARS-CoV-infected host cells, and 
those authors therefore have hypothesized that the clinical ef¬ 
fects of SARS-CoV infection in humans may be due to a direct 
cytopathic effect of the virus, rather than to virus-induced im- 
munopathological damage. A study of 33 complete SARS-CoV 
genomes [15] found that ORF 10 had the highest point-mu¬ 
tation rate, a rate that was higher than that in parts of the 
replicase, S, and N genes; because this high mutation rate has 
also been reported for a similar region (ORF 10') in the SL- 
CoVs recently isolated from 3 species of horseshoe bats (genus 
Rhinolophus) [4], ORF 10 seems to be present in human and 
other mammalian SARS-CoV and SL-CoV species, although its 
function remains unknown. In both the human and civet-cat 
host, it is unclear whether the reported deletions in the various 
ORFs of the SARS-CoV genome have any significant clinical 
consequence; however, at least 1 of them, the 29-nt deletion, 
was considered to be a marker of a human-host origin when 
it was found in SARS-CoV isolated from a pig [16]. Therefore, 
at present, the region covered by ORFs 10 and 11 would seem 
to be useful as a genetic marker of a possible animal origin 
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of SARS-CoV, rather than as a therapeutic or vaccine target— 
in contrast to its structural counterparts, such as the S gene 
(ORF 2) [17]. 
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